Toward Widely-Available and Usable Multimodal Conversational Interfaces

نویسنده

  • Alexander Gruenstein
چکیده

Multimodal conversational interfaces, which allow humans to interact with a computer using a combination of spoken natural language and a graphical interface, offer the potential to transform the manner by which humans communicate with computers. While researchers have developed myriad such interfaces, none have made the transition out of the laboratory and into the hands of a significant number of users. This thesis makes progress toward overcoming two intertwined barriers preventing more widespread adoption: availability and usability. Toward addressing the problem of availability, this thesis introduces a new platform for building multimodal interfaces that makes it easy to deploy them to users via the World Wide Web. One consequence of this work is City Browser, the first multimodal conversational interface made publicly available to anyone with a web browser and a microphone. City Browser serves as a proof-of-concept that significant amounts of usage data can be collected in this way, allowing a glimpse of how users interact with such interfaces outside of a laboratory environment. City Browser, in turn, has served as the primary platform for deploying and evaluating three new strategies aimed at improving usability. The most pressing usability challenge for conversational interfaces is their limited ability to accurately transcribe and understand spoken natural language. The three strategies developed in this thesis – context-sensitive language modeling, response confidence scoring, and user behavior shaping – each attack the problem from a different angle, but they are linked in that each critically integrates information from the conversational context. Thesis Supervisor: Stephanie Seneff Title: Principal Research Scientist

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Study of Speech Interfaces for the Vehicle Environment

The impact of age, gender, and technology experience on acceptance and quality of interaction was evaluated using an informational retrieval system combining manual control elements and a visual display with a naturalistic conversational speech based interface (City Browser). In addition to the technical challenges of developing useful human machine interfaces (HMIs), there is increasing recogn...

متن کامل

Designing a Conversational Interface for a Multimodal Smartphone Programming-by-Demonstration Agent

In this position paper, we first summarize our work on designing the conversational interface for SUGILITE – a multimodal programming by demonstration system that enables a virtual agent to learn how to handle out-ofdomain commands and perform the tasks using available third-party mobile apps in task-oriented dialogs from the user’s demonstrations. We then discuss our planned future work on ena...

متن کامل

Eye Gaze for Reference Resolution in Multimodal Conversational Interfaces

EYE GAZE FOR REFERENCE RESOLUTION IN MULTIMODAL CONVERSATIONAL INTERFACES

متن کامل

Towards A System Of Patterns For The Design Of Multimodal Interfaces

Since R. Bolt’s seminal "Put that there" demonstrator, more and more robust and innovative modalities can be used and empirical work on the usage of multiple modalities is now available for guiding the design of efficient and usable multimodal interfaces. This paper presents a system of patterns for capitalizing and formalizing this design knowledge about multimodal interfaces as patterns. Patt...

متن کامل

Multimodal Human-Computer Interfaces Editors

The goal of multimodal interfaces is to extend the sensory-motor capabilities of computer systems to better match the natural communication means of human beings. Multimodal interfaces represent a very active interdisciplinary research area which has expanded rapidly. Since the seminal “Put that there” demonstrator by R. Bolt (1980) that combines speech and gesture, significant achievements hav...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009